Main
Sana’ Zamil
I have made visualizations viewed by hundreds of thousands of people, sped up query times for 25 terabytes of data by an average of 4,800 times, and built packages for R that let you do magic.
Currently searching for a data science position that allows me to build tools using visualization and machine learning to help people explore and understand their data.
Education
Diploma: Palestinian Studies
Online
Academy of Refugee Studies
May-19 - Sep-18
B.S., Architecture Engineering & Building Systems
Amman, JO
Middle East Univercity (MEU)
Jun-15 - Feb-12
Diploma: Architecture Engineering
Amman, JO
Wadi AL-Sir Training Center (W.S.T.C)
Jun-11 - Oct-09
M.S Program Evaluation and Data Analytics
Online
Arizona State Univercity (ASU)
Dec-22 - Jan-21
Work Experience
Urban Planner
Amman, JO
Ministry Of Local Administration
Now - Apr-16
- Creating Comprehensive Plans and Designs for Land Use, Zoning, Buildings Types and Transportation across Different Areas.
- Updating and Correcting Urban Plans, Blueprints and Drawing Using GIS, MicroStation and AutoCAD Programs.
- Gathering and Analysing Data for Studies and Reports and Made Recommendations Based on Findings.
IFormBuilder Developer
Amman, JO
Caritas
Apr-16 - Oct-15
- Developed an IFormBuilder Beneficiary Vulnerability & Needs Assessment Form.
- Conducted End-User Trainings.
- Collaborated with IFormBuilder Helpdesk and IT Consultants to Resolve problems.
Intern / Monitoring & Evaluation
Amman, JO
Ruwwad
Apr-16 - Apr-15
- Made a Manual about Detailed Work of Ruwwad.
- Monitored Programs Evaluation Tools (Excel).
- Developed an Internal System with the Department.
Industry Experience
I have worked in a variety of roles ranging from journalist to software engineer to data scientist. I like collaborative environments where I can learn from my peers.
Data Journalist - Graphics Department
New York, New York
New York Times
2016
- Reporter with the graphics desk covering topics in science, politics, and sport.
- Work primarily done in R, Javascript, and Adobe Illustrator.
Engineering Intern - User Experience
Burlington, VT
Dealer.com
2015
- Built internal tool to help analyze and visualize user interaction with back-end products.
Data Science Intern
Burlington, VT
Dealer.com
2015
- Worked with the product analytics team to help parse and visualize large stores of data to drive business decisions.
Data Artist In Residence
Carpinteria, CA
Conduce
2015 - 2014
- Envisioned, prototyped and implemented visualization framework in the course of one month.
- Constructed training protocol for bringing third parties up to speed with new protocol.
Software Engineering Intern
Carpinteria, CA
Conduce
2014
- Incorporated d3.js to the company’s main software platform.
Teaching Experience
I am passionate about education. I believe that no topic is too complex if the teacher is empathetic and willing to think about new methods of approaching task.
Data Visualization Best Practices
N/A
DataCamp
2019
- Designed from bottom up course to teach best practices for scientific visualizations.
- Uses R and ggplot2.
- In top 10% on platform by popularity.
Improving your visualization in Python
N/A
DataCamp
2019
- Designed from bottom up course to teach advanced methods for enhancing visualization.
- Uses python, matplotlib, and seaborn.
Advanced Statistical Learning and Inference
Nashville, TN
Vanderbilt Biostatistics Department
2018 - 2017
- TA and lectured
- Topics covered from penalized regression to boosted trees and neural networks
- Highest level course offered in department
Advanced Statistical Computing
Nashville, TN
Vanderbilt Biostatistics Department
2018
- TA and lectured
- Covered modern statistical computing algorithms
- 4th year PhD level class
Statistical Computing in R
Nashville, TN
Vanderbilt Biostatistics Department
2017
- TA and lectured
- Covered introduction to R language for statistics applications
- Graduate level class
Selected Data Science Writing
I regularly blog about data science and visualization on my blog LiveFreeOrDichotomize.
Using AWK and R to Parse 25tb
N/A
LiveFreeOrDichotomize.com
2019
- Story of parsing large amounts of genomics data.
- Provided advice for dealing with data much larger than disk.
- Reached top of HackerNews.
Classifying physical activity from smartphone data
N/A
RStudio Tensorflow Blog
2018
- Walk through of training a convolutional neural network to achieve state of the art recognition of activities from accelerometer data.
- Contracted article.
The United States of Seasons
N/A
LiveFreeOrDichotomize.com
2018
- GIS analysis of weather data to find the most ‘seasonal’ locations in United States
- Used Bayesian regression methods for smoothing sparse geospatial data.
A year as told by fitbit
N/A
LiveFreeOrDichotomize.com
2017
- Analyzing a full years worth of second-level heart rate data from wearable device.
- Demonstrated visualization-based inference for large data.
MCMC and the case of the spilled seeds
N/A
LiveFreeOrDichotomize.com
2017
- Full Bayesian MCMC sampler running in your browser.
- Coded from scratch in vanilla Javascript.
The Traveling Metallurgist
N/A
LiveFreeOrDichotomize.com
2017
- Pure javascript implementation of traveling salesman solution using simulated annealing.
- Allows reader to customize the number and location of cities to attempt to trick the algorithm.
Selected Press (About)
Great paper? Swipe right on the new â€<U+06A9>Tinder for preprints’ app
N/A
Science
2017
- Story of the app Papr made with Jeff Leek and Lucy D’Agostino McGowan.
Swipe right for science: Papr app is â€<U+06A9>Tinder for preprints’
N/A
Nature News
2017
- Second press article for app Papr.
The Deeper Story in the Data
N/A
University of Vermont Quarterly
2016
- Story on my path post graduation and the power of narrative.
Selected Press (By)
The Great Student Migration
N/A
The New York Times
2016
- Most shared and discussed article from the New York Times for August 2016.
Wildfires are Getting Worse, The New York Times
N/A
The New York Times
2016
- GIS analysis and modeling of fire patterns and trends
- Data in collaboration with NASA and USGS
Who’s Speaking at the Democratic National Convention?
N/A
The New York Times
2016
- Data scraped from CSPAN records to figure out who talked and past conventions.
Who’s Speaking at the Republican National Convention?
N/A
The New York Times
2016
- Used same data scraping techniques as Who’s Speaking at the Democratic National Convention?
A Trail of Terror in Nice, Block by Block
N/A
The New York Times
2016
- Led research effort to put together story of 2016 terrorist attack in Nice, France in less than 12 hours.
- Work won Silver medal at Malofiej 2017, and gold at Society of News and Design.
Selected Publications, Posters, and Talks
Charge Reductions Associated with Shortening Time to Recovery in Septic Shock
N/A
Chest
2019
- Authored with Wesley H. Self, MD MPH; Dandan Liu, PhD; Stephan Russ, MD, MPH; Michael J. Ward, MD, PhD, MBA; Nathan I. Shapiro, MD, MPH; Todd W. Rice, MD, MSc; Matthew W. Semler, MD, MSc.
Multimorbidity Explorer | A shiny app for exploring EHR and biobank data
N/A
RStudio::conf 2019
2019
- Contributed Poster. Authored with Yaomin Xu.
Taking a network view of EHR and Biobank data to find explainable multivariate patterns
N/A
Vanderbilt Biostatistics Seminar Series
2019
- University wide seminar series.
Patient-specific risk factors independently influence survival in Myelodysplastic Syndromes in an unbiased review of EHR records
N/A
Under-Review (copy available upon request.)
2019
- Bayesian network analysis used to find novel subgroups of patients with Myelodysplastic Syndromes (MDS).
- Analysis done using method built for my dissertation.
Patient specific comorbidities impact overall survival in myelofibrosis
N/A
Under-Review (copy available upon request.)
2019
- Bayesian network analysis used to find robust novel subgroups of patients with given genetic mutations.
- Analysis done using method built for my dissertation.
R timelineViz: Visualizing the distribution of study events in longitudinal studies
N/A
Under-Review (copy available upon request.)
2018
- Authored with Alex Sunderman of the Vanderbilt Department of Epidemiology.
Continuous Classification using Deep Neural Networks
N/A
Vanderbilt Biostatistics Qualification Exam
2017
- Review of methods for classifying continuous data streams using neural networks
- Successfully met qualifying examination standards
Asymmetric Linkage Disequilibrium: Tools for Dissecting Multiallelic LD
N/A
Journal of Human Immunology
2015
- Authored with Richard Single, Vanja Paunic, Mark Albrecht, and Martin Maiers.
An Agent Based Model of Mysis Migration
N/A
International Association of Great Lakes Research Conference
2015
- Authored with Brian O’Malley, Sture Hansson, and Jason Stockwell.
Declines of Mysis diluviana in the Great Lakes
N/A
Journal of Great Lakes Research
2015
- Authored with Peter Euclide and Jason Stockwell.